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This is an attempt to appreciate the science which has been recognized and rewarded with 
Nobel prizes in Chemistry, Physics and Medicine for the year 2024, an annual effort on my 
part for the past many years. 


We begin with the Nobel Prize in physiology or medicine for the year 2024. Victor Ambros 
and Gary Ruvkun win this year's prize for the discovery of microRNA and its role in 
post-transcriptional gene regulation. 


Victor Ambros Gary Ruvkun 


The complexity of the molecular processes in even the simplest of living organisms is 
astounding. All this is defined in genes, and the information coded in DNA is first transcribed 
into RNA and then translated into proteins. RNA as the intermediary is absolutely essential. 
DNA as everyone knows is a double helix and the two strands are complementary. That is 
how cell division takes place. Once the two strands are separated, molecules in the 
neighbourhood rapidly attach themselves to each of the strands and two copies of DNA are 
obtained. But this prevents “reading” the information in genes and using it to make proteins. 
So the RNA which will not normally form a double helix is needed. Proteins separate the 
strands of the DNA, make an RNA copy of the specific gene required and make a protein 
like for example insulin, required for extracting the energy from a molecule of sugar. Even a 
very small change in the protein structure during any of these steps will adversely affect the 
catalysis of the cellular chemical reaction. Insulin has to be produced in the pancreas and 
not in any other cell of the body. But the gene for producing insulin is present in each and 
every cell of the body. Every cell has the entire DNA. Obviously there must have been strict 
control of which genes are “expressed” as proteins and to ensure that randomness at the 
atomic and molecular levels is severely curtailed during the transcription and translation. By 
the time Ambrose and Ruvkun began their investigations, several mechanisms which control 
transcription and translation were well known. 


Their work began as routine investigation of mutant C. elegans nematodes with 
developmental defects caused by alterations at the lin-4 and lin-14 genetic loci. Model 


organisms like this nematode worm have been extensively investigated in genetic research 
and yielded many unexpected discoveries. Sydney Brenner 

introduced the nematode worm Caenorhabditis elegans (C. elegans) over five decades ago. 
The organism reproduces fast and genetic manipulation is easy. Sydney Brenner, John 
Sulston and Robert Horvitz used it understand how cell division, differentiation, and cell 
death are genetically controlled. For these discoveries they were awarded the 2002 Nobel 
Prize in Physiology or Medicine. 


Mutant lin-4 worms accumulate internal eggs without forming a vulva, while lin-14 mutants 
are small and lack larval development. Several important observations were made by 
Ambros and Ruvkun. (1) There are several variants of the worm with lin-14 mutation. (2) A 
second mutation in the end of the RNA, the part which is not translated into protein, causes 
increased production of the mutant lin-14 protein. (3) There is some inverse correlation 
between the development dynamics of the lin-4 and lin-14 mutations. (4) There is only one 
mutant lin-4 and the mutation causes very large changes in the organism, suggesting that 
this mutation is affecting the control of translation and transcription in the organism. The 
results on lin-4 and lin-14 were very intriguing. 


Ambros and Ruvkun continued their diligent investigations on these mutant worms. Ambros 
and his colleagues concentrated on cloning the lin-4 gene and determining the sequence of 
the RNA. The result was extremely surprising. The gene coded for an extremely short, non 
coding 22 nucleotide sequence. Simultaneously Ruvkun’s lab determined the RNA 
sequence for the lin-14. 
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Complementarity of the microRNA and the RNA 3’UTR 


The two collaborators simultaneously realised that there was a partial complementarity 
between the 22 nucleotide sequence of lin-4 and the noncoding trailing end of the lin-14 
RNA. All mRNA molecules have these noncoding ends. The RNA starts off with 5’°UTR which 
binds to the ribosome molecules to start translation. The trailing or 3’°UTR is usually a signal 
for ending translation and release of the protein by the ribosomes. Clearly the lin-4 RNA 
sequence is binding to the 3’UTR of the lin-14 RNA due to the complementarity. This 
explains why the second mutation causes increased protein production as mentioned above. 


Lin4 is not complementary to the RNA end with the second mutation. At this stage it was not 
known if this interference was a quirk of the particular organism. The significance of this 
result was not immediately recognised. So much so Ambros was denied tenure (permanent 
professorship) in Harvard University and he relocated to Dartmouth College immediately 
after this work was completed. 


The second example of mutation due to microRNA was discovered only 7 years later. But 
the identification of let-7 by Ruvkun and co-workers triggered huge interest in microRNA 
because it was shown to be conserved in organisms as diverse as fruit flies and humans. 
This was a clue that microRNAs are an important part of gene regulation in all living 
organisms. 
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Tree of life showing organisms with conserved microRNA 


Later work has confirmed this. For example, as of now, 90 families of miRNAs conserved 
at least since the common ancestor of mammals and fish are known. Studies on 
mice in which genes for one or more members of a microRNA family have been 
knocked out have shown that these microRNAs play extremely important roles in cell 
physiology. Binding of microRNAs to mRNAs lead to their silencing by cleaving, 
destabilising or preventing their transcription. However, the specific steps in these 
activities vary significantly from case to case and are also extremely complex. The 
microRNAs fold on themselves to form hairpins. And even these are shown to 
participate actively in cellular processes. Just as microRNA is involved in the normal 
functioning of eukaryotic cells, so has dysregulation of microRNA been associated 
with disease. The most important current example of a disease linked to 
dysregulation of microRNA is chronic lymphocytic leukaemia. Obviously, microRNAs 
have the potential to be used as tools or targets for treatment of different cancers. 


Because of these developments, the original work of Ambros and Ruvkun has been 
recognised as extremely important and they have shared a number of awards before 
this year’s Nobel Prize. These include the 2004 Rosenstiel Award, the 2008 Warren 
Triennial Prize, the 2008 Gairdner Foundation International Award, the 2008 


Benjamin Franklin Medal in Life Science, the 2008 Lasker Foundation Award for 
Basic Medical Research, the 2009 Louisa Gross Horwitz Prize, the 2009 Massry 
Prize, the 2012 Dr. Paul Janssen Award for Biomedical Research, the 2014 Wolf 
Prize for Medicine, the 2015 Breakthrough Prize in Life Sciences, and the 2016 
March of Dimes Prize in Developmental Biology. 


The award of this Nobel prize has created a lot of comments about the gender bias 
of the committee that awarded the Nobel Prize since Rosalind Lee, wife of Victor 
Ambros, was the first author of the breakthrough paper in 1993. But she did not 
share any of the earlier awards either. She joined her husband as an assistant 11 
years after her bachelor's degree and has always worked with him. Rather than 
accuse everyone involved in every selection in the above list, of gender bias, one 
has to perhaps accept that Rosalind Lee always opted to avoid the limelight and stay 
in the shadow of her husband of 48 years. 


The Nobel Prize in Physics, for the year 2024 has been awarded jointly to John J. Hopfield 
and Geoffrey E. Hinton “for foundational discoveries and inventions that enable machine 
learning with artificial neural networks”. This year's awards in physics and chemistry are very 
closely related. The work recognised for the physics award has been the basis for the work 
rewarded with the chemistry prize. Recognising work on biomolecules and biochemistry with 
nobel prizes in chemistry has been a regular feature. But to see biology being mentioned in 
a physics award is as far as | know unique. So is the award of a Nobel prize in physics to a 
person with the bachelor's degree in experimental psychology and a doctorate in cognitive 
sciences. In addition, both awards are, as will become obvious, really for mathematics and 
computer programming. 


John. J. Hopfield. Geoffrey E Hinton 


One suspects that the decision to award the chemistry prize to the developers of computer 
programs based on artificial neural networks triggered this choice to award a prize to the 
pioneers of artificial neural networks. The Nobel committee stated that the laureates used 
physics to find patterns in information to justify this. But as will be seen, the link between the 
physics of magnetism and the design of artificial neural networks is metaphorical and 
extremely tenuous. 


A neuron has a nucleus, cell wall etc like any other cells but also has unique parts 
called the axon, dendrites and synaptic terminals. It releases neurochemicals at the 


synaptic terminals when it fires. This firing takes place when the cumulative signal 
from all its dendrites crosses a threshold. The dendrites sense the neurochemicals 
released by the firing of other neurons and open ion channels to either increase or 
decrease ion concentration inside the cell. When this ion concentration crosses a 
threshold, an electric signal passes along the axon and triggers the release of 
neurochemicals near the synaptic terminals. Memory formation in a neural network has 
been described more than 75 years ago as the strengthening of the synaptic connection 
between two neurons due to repeated activation of the particular synaptic connection. 
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In the language of computer science, the firing of a neuron is a digital signal produced when 
the weighted averages of the digital signals produced by other neurons crosses a threshold. 
The first attempt to implement an artificial neural network on this basis was undertaken in 
1957. A machine, Perceptron mark | was built and successfully demonstrated. The working 
was also simulated on the then state of the art IBM mainframe computer. 

It had a single layer of artificial neurons taking its input from an array (20x20) detectors. 
Weights were manually adjusted with rheostats. It created a lot of interest, but it was pointed 
out shortly afterwards that a single layer neural network of this type is of very limited utility. 
The IBM mainframe of that era used vacuum tubes and could at best perform a few 
thousand arithmetic operations per second contributing to limiting the effort to a single layer. 
The 20x20 detector array was equally incapable of being scaled up. 


The work by Hopfield in the early 80s set up a strong mathematical foundation for the entire 
field of artificial neural networks. By this time, even a desktop computer could perform 
millions of mathematical operations per second. So implementing multi layer artificial neural 
networks and avoiding the limitations of the Perceptron era had become possible. 


John J Hopfield is a distinguished physicist who made significant contributions in many 
areas of theoretical physics including excitons, polaritons, Kondo effect, and 
pseudopotentials. He also introduced a mechanism for error correction in biochemical 
reactions called kinetic proofreading. 


The spins in a ferromagnet interact through the exchange coupling which could make either 
the two spins being aligned parallel (the ferromagnetic case) or ant-parallel (the 
antiferromagnetic case) the lower energy state. Thermal energy energy would disturb this 
alignment. At low enough temperatures the spins will be aligned while at high enough 
temperatures there would be no alignment and the spins will point in random directions. The 


mean field theory is an approximation in which the coupling between a given spin and all the 
others is represented by a mean field at each spin. When there is only one type of coupling, 
there will be one and only one ground state with the minimum value of energy, the one in 
which all the spins are aligned in the ferro (or antiferro) magnetic order. In 1975, David 
Sherrington and Scott Kirkpatrick explored the consequences of the presence of 
simultaneous random ferro and antiferromagnetic coupling. The coupling between any two 


spins could be either ferro or antiferromagnetic but symmetric (the exchange from “i” to “j’ is 


the same as that from ‘j” to “i’). They showed that there are an infinite number of local 
energy minima and labelled the phase a spin glass. 


Hopfield realised that multiple minima in energy are not contingent on the presence of the 
two types of exchange couplings. The mathematical equations would lead to multiple 
minima in a wide variety of situations. He used this insight to answer a most important 
question in biology. Are the stability of memories, construction of categories of 
generalisations and time-sequential memory, emergent properties of the entire neural 
network and collective in origin just as vortices in liquid flow or ferromagnetic domains, which 
are not properties of individual atoms? Since real life neural networks are extremely complex 
and poorly understood, he sought to develop a mathematical model in which the collective 
properties do not depend on the specific details. This is a reasonable requirement. On an 
average, a neuron connects to about 7000 other neurons in the human brain. 


He recast the two equations defining mean field magnetic theory to define a neural network 
of N binary nodes. 
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The first of these converts the description of the working of a neuron described above into 
mathematics. The weight W; determines how much the neurochemical released by neuron 
“!” influences neuron “j’. The second equation is more subtle. It represents the correlation 
between the two neurons, the probability that both are firing or both quiescent. As Hopfield 
observed, this case is isomorphic with an Ising model in which the weight is replaced by the 
exchange coupling between spins. The work on spin glasses focussed on an infinitely large 
number of spins. Hopfield numerically solved the equations for a small number of nodes. It 
thus became the model of a general content addressable memory, now called a Hopfield 
network, that would be capable of retrieving the entire memory on the basis of sufficient but 
partial information. Hopfield showed that if the weights in the network are adjusted initially to 
a minimum value of a “fictitious energy” for a specific input of information (This is the process 
of creating a memory or learning) and subsequently interrogated, the system will dynamically 
move to the minimum, even with partial information. 


Obviously the similarity between magnetism and artificial neural networks is extremely 
feeble. In the case of magnets the strength of the exchange between two spins depends 
only on the distance separating the two spins. In a neural network the strength depends on 
the information stored. And the memory capacity depends on the number of neurons as well 
as their organisation. The so called energy in the artificial neural network is fictitious and 
there is no “information” in magnets or spin glass phases. 


Hopfield in the seminal publication identified several reasons for the limitations of the earlier 
mathematical models of artificial neural networks (and the Perceptron). The introduction of 
feedback from upper layers to all the lower layers is the first of these. The absence of a 
master clock that ensures simultaneous change of state of all neurons is a second. (A 
master clock is the legacy of thinking in the language of ordinary digital computers). The use 
of a step function in the place of a linear input output relation as in the Perceptron is another. 
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Feed forward network with one hidden layer 
Later work has shown that the lack of synchronisation is perhaps the most important reason. 
Multilayer networks without feedback, called the feed forward networks, are extensively 
used for some applications. These have one or more hidden layers. While the artificial 
neural networks being asynchronous seems to bring them closer to real neural organisation, 
it should be emphasised that the artificial neural networks are purely mathematical 
structures. 


Building on this work, Hopfield developed a mathematical model where the input output 
relationship is continuous rather than a step. This enabled handling several optimization 
problems like the travelling salesman problem using this continuous neural networks. While 
an answer could be obtained, it was not possible in finite time to demonstrate that it was the 
best answer. 


Hopfield network Boltzmann Machine Restricted Boltzmann 
Machine 


Hopfield network has no hidden nodes. Training is with individual vectors. The Boltzman 


machine of Hinton has hidden nodes and the training is with a distribution of vectors. The 
restricted Boltzmann machine has no links between input nodes and hidden nodes. 


The Hopfield network was of limited use partly because the memory capacity was limited. 
Only about 138 vectors could be stored in a network of 1000 nodes. Further, the memory 
required scaled linearly with the number of input features. Modified or dense Hopfield 
networks have been developed to overcome these limitations. In these the functional form of 
the “fictitious energy term” is completely different from the original one borrowed from 
physics. 


Geoffrey Hinton and coworkers developed a stochastic extension of Hopfield’s model, called 
the Boltzmann machine. Here, each state s=(s1,...,sN) of the network is assigned a 
probability given by the Boltzmann distribution. 
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As before, T is a fictitious temperature and the second term is a bias field. Unlike a Hopfield 
network, where a single training vector is compared to a single test output, in a Boltzman 
machine, the statistical distributions of the training and testing patterns are compared. Hinton 
and his colleagues developed gradient-based learning algorithms for parameter 
determination. Once again, the Boltzman machine was of limited use and Hinton developed 
the restricted Boltzmann machine (RBM) which has become a versatile tool. Modifications 
and improvements of artificial neural networks are not selected by physics based justification 
but on the utility. 


The real contribution of Hinton to the area of artificial neural networks is the development of 
algorithms for their practical use. An RBM network has weights only between visible and 
hidden nodes, and no weights connect two nodes of the same type. For RBMs, Hinton 
created an efficient approximate and extremely fast learning algorithm. He then developed a 
pretraining procedure for multilayer networks, in which the layers are trained one by one 
using an RBM. After pre-training, it became possible to perform a global parameter 
finetuning using the backpropagation algorithm. The pre-training with RBMs picked up 
structures in data, such as corners in images, without using labelled training data. Having 
found these structures, labelling those by backpropagation turned out to be a relatively 
simple task. By linking layers pre-trained in this way, Hinton was able to successfully 
implement examples of deep and dense networks, a milestone toward what is now known as 
deep learning. 


The claim that physics has been a driving force underlying inventions and development of 
ANNs is questionable. But the contributions of ANNs to science, medicine and everyday life 
are unquestionably significant. Deep learning architectures are trained to reproduce energies 
of phases of materials, as well as the shape and strength of interatomic forces, with an 
accuracy comparable to ab initio quantum-mechanical models. These ANN trained atomic 
models are used for a faster determination of phase stabilities and the dynamics of new 
materials. ANN have provided higher resolutions in physics-based climate models. ANNs 
have been trained to pick out specific patterns in the large volumes of detector data being 
generated at a high rate helping in the identification of the Higgs Boson, the TOP quark, in 
producing the neutrino image of the milky way as well as the image of the blackhole at the 
centre of the galaxies. ANNs have been demonstrated to be superior in detecting breast 
cancer cases among routine scans and correcting for movement in MRI scans. Today, ANNs 


are behind many things we do in everyday life such as image recognition, language 
generation and internet surfing. 


The Nobel Prize in Chemistry for the year 2024 has been awarded to David Baker, Demis 
Hassabis and John Jumper for computational protein design and protein structure prediction. 


John Jumper 


David Baker. 


Proteins are long polymeric chains of amino acids. Genes specify the sequence of amino 
acids or the primary structure of the protein. This chain folds successively into the 
secondary, tertiary and in some cases quaternary structures. Proteins catalyse or inhibit 
every chemical process in all organisms. Any alteration in the gene alters the function of the 
protein. Obviously the tertiary or quaternary structure has reaction site where other 
molecules get attached. The modification in genes is altering the structure of this all 
important functional site. Equally obviously the tertiary or quaternary structures are uniquely 
determined by the primary structure. Chris Levinthal estimated that the number of possible 
ways in which a long protein can fold is ~ 104”. So this folding must be totally defined by the 
sequence of amino acids. Christian Anfinsen (Nobel prize, Chemistry, 1972) experimentally 
confirmed this logic. He found that if a protein is reversibly denatured, it would always refold 
into the same 3D conformation. 
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Protein Structure 

Folding of a protein like all other processes in a living organism follows the laws of physics 
and chemistry. So the basic forces of interaction between the atoms in a protein are known. 
The protein chain is covalently bonded and the protein folding is caused by weaker 
electrostatic forces sometimes mediated by hydrogen bonding. Only then can the protein be 
denatured without disturbing the primary structure. But unlike in the case of other biological 
processes such as transcription of genes or functioning of vaccines, it has not been possible 
to divide protein folding into smaller steps that could be individually studied with scientific 
methods. 


So it has not been easy to know the final 3D structure of the protein from the known primary 
structure. It was also not easy to design a protein with the desired properties and build it 
using amino acids. One had to rely on the indirect method of modifying the genes and 
testing the consequences, very much in the spirit of natural evolution and traditional animal 
breeding. This year's prize celebrates the achievement of both goals. One wonders though if 
the 50 year old grand challenge in biology as an earlier Nobel laureate called it has been 
answered. 


Over the years, more progress has been achieved in protein design than protein structure 
prediction. This is obviously because smaller targets could be set. As far back as 1951, 
Linus Pauling logically argued that proteins will consist of helical patterns now called the 
alpha helices. Alpha helices and beta sheets are formed in all proteins and they are called 
the secondary structure. Using the structures of 15 proteins available in 1974, an attempt 
was made to estimate the probability of a specific amino acid being in the helix or sheet. 
From this an approximate idea of the final protein structure could be obtained for any other 
protein. It was also known that some of the side chains of amino acids are hydrophobic while 
others are hydrophilic. Since the medium in living cells is water based, it is reasonable to 
expect that hydrophobic parts will be in the interior of the protein after folding and that the 
hydrophilic ones on the surface. So, a helical structure with the internal surface being 
hydrophobic and external surface hydrophilic could be extremely stable. It was possible to 
design artificial proteins of this type and confirm that they are stable. But these are not 
realistic usable proteins. 


As scientists tried to design and produce more complex proteins, it became obvious that 
these simple rules of thumb and phenomenological relations like the Lennard Jones potential 
will not be sufficient. In the mid nineties experimental procedures for routinely determining 
protein structures became available and the number of known protein structures increased 
rapidly. It now became possible to correlate parts of the 3D structures of different folded 
proteins with the amino acid sequences and use computer programs to design proteins. 


Comparison of the naturally occurring Zn ion stabilized protein and the artificial one 
An important step was the fabrication of an artificial protein with about 23 amino acid 
residues. The naturally occurring protein is stabilised by zinc ions. An artificial protein 


stabilised into the same 3D structure without the metal ions was predicted by a computer 
program and successfully produced in the laboratory. Incorporating multiple empirical and 
phenomenological relationships based on known physics and chemistry into computer 
programs became the accepted methodology for protein design. 
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David Baker and his colleagues developed the Rosetta program to generate possible 
structures for larger proteins and also estimate their energies to determine if they are stable. 
In 2003 they successfully produced in the laboratory a completely artificial protein (there is 
no similar naturally occurring protein with that folding sequence) predicted to be stable by 
the program. It was 93 residues long, much larger than for example Insulin which has only 
51 residues. 


Baker and his colleagues produced new enzymes to act as catalysts with this computer 
program. Catalytic action has been demonstrated though the sensitivity is not very large. 
The team was extremely successuo in designing proteins for targeted delivery of medicines. 
Steroids for example could be attached to one side of the protein which when introduced into 
the living organism preferentially binds to a target. 


The CASP experiments (Critical Assessment of protein Structure Prediction) founded by 
John Moult and Krzysztof Fidelis in 1994 allowed truly blind predictions of protein structures 
to be made by computer programming and compared to new experimental structures 
determined by X-ray crystallographers and NMR spectroscopists, who withheld publishing 
their data until after the submission deadline of the CASP entries. The programmers used 
genetic algorithms, molecular dynamics simulations, Monte Carlo methods and ANNs. When 
the number of experimentally determined structures started to increase rapidly, it became 
possible to do multiple sequence alignment which contains information about pairs of amino 
acids in contact with each other in 3D. However, for twenty years, till 2016, the accuracy of 
prediction of protein structures was quite low. In 2016, it was realised that the earlier 
methods of correlating mutations could not distinguish between directly and indirectly 
correlated residues. Improvement of protein structure prediction using machine learning and 
neural networks to a precision of over 45% as measured by a global distance test (GDT) 
became possible. 


In 2018 the DeepMind founded and led by Demis Hassabis, constructed a computer 
program based on a convolutional neural network, AlphaFold1 which achieved an accuracy 


of 70%, high enough to allow translation of the contact or distance maps into 3D structures. 
This was clearly far superior to all other programming efforts up to that time. Hassabis and 
his team had also published the AlphaZero program, also based on deep learning, that 
showed unsurpassed performance Go, Chess and other two-party games. In 2020, 
DeepMind achieved the accuracy competitive with experimental structures (GDT score of 
over 90%) for a majority of targets, with a completely new AlfaFold2 (AF2) program. The 
AF2 team led by John Jumper and Hassabis had thus finally succeeded in solving the 
protein structure prediction problem for monomeric proteins to within a backbone accuracy of 
about 1 A. 


The AF2 unlike AF1 does not use the convolutional neural network but what is called a 
transformer architecture. It does not identify correlated mutations either. It works 
simultaneously with a 2D matrix of aligned sequences and a representation of pair 
distances. The two update each other during the learning process. Another module directly 
operates on a 3D backbone structure whose geometry is defined in terms of triangles formed 
by the NC a-C atoms of each residue and float around freely as rigid bodies moved by the 
network to form the structure. The similarity with the dense Hopfield network and restricted 
Boltzman machine which drifted away from physics principles, discussed as part of the 
physics award is obvious. The success of Rosetta was also due to the specific empirical 
relationships used. But no new “principles” of physics, chemistry or biology were uncovered. 
The justification of this program also is merely its success. When the ANNs developed by 
the DeepMind company proved superior, the Rosetta team unhesitatingly shifted to the new 
routines. 

The Nobel award committee offers this “ extremely vague guess” as the probable 
explanation for the program’s performance. “It (AF2) has effectively learned a potential of 
mean force (free energy surface), i.e., probability distributions for interatomic distances 
between pairs of atom types. There is thus a direct connection to the physical principles of 
protein structure, where the “knowledge” acquired by the program can be used to accurately 
determine structures.” This is no different from the claim that “physics has been a driving 
force underlying inventions and development of ANNs’. 


The AF2 architecture can of course be described as an ingenious piece of neural network 
engineering by Jumper, Hassabis and their coworkers. It can be viewed as the first real 
scientific breakthrough of artificial intelligence. To reward the pioneers of this type of 
mathematical wizardry is justified. It is intellectually more honest to accept that 
mathematicians have got the Nobel prizes as they deserve to, when they condescend to 
work in usable, non esoteric mathematics just as an engineer without even a Ph. D. had 
been awarded the Nobel prize in Medicine because he helped develop the CAT scanner. 
The mathematics had their day of frustration when only a computer could prove that four 
colours are sufficient to paint any map. In that case one could argue that the problem was in 
principle solvable by humans and only the number of possible map configurations that had to 
be tested was too large. As is obvious from the above analysis, “ exactly how” an artificial 
neural network learns is just as obscure as “ exactly how” human memory works. Perhaps 
the real analogy should be with quantum theory where the mathematical equations work and 
are thus used even though any description is counter intuitive. 


